A Histogram-Based Approach to Mathematical Line Segmentation

نویسندگان

  • Mohamed A. Alkalai
  • Volker Sorge
چکیده

In document analysis line segmentation is a necessary prerequisite step for further analysing of textual components. While much work has been devoted to line segmentation of regular text documents, this work can not be easily adopted to documents that contain specialist components such as tables or mathematical expressions. In this paper we concentrate on a line segmentation technique for documents containing mathematical expressions, which, due to their two dimensional structure are often comprised of multiple distinct lines. We present an approach to line segmentation in the presence of mathematics that is based on a set of histogram measures and heuristics considering vertical and horizontal distances of characters only. The method also provides a technique to distinguish consecutive lines that are vertically overlapped but belong to different mathematical expressions. Experiments on data sets of 200 and 1000 maths pages, respectively, show a high rate of accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Activity Based Video Content Trajectory Representation and Segmentation

A novel approach is developed to segment continuous CCTV recordings according to the activities captured in the scene. This approach differs from previous approaches which are mostly based on shot change detection and shot grouping. Video content is represented by constructing a cumulative multi-event histogram over time. An on-line segmentation algorithm is then proposed to detect breakpoints ...

متن کامل

Texture image segmentation using a new descriptor and mathematical morphology

In this paper we present a new texture descriptor based on the shape operator defined in differential geometry. Then we describe the texture feature analysis process based on the spectral histogram. After that we describe a new algorithm for texture segmentation using this descriptor, statistics based on the spectral histogram, and mathematical morphology. Many results are presented to illustra...

متن کامل

Performance Analysis of Segmentation of Hyperspectral Images Based on Color Image Segmentation

Image segmentation is a fundamental approach in the field of image processing and based on user’s application .This paper propose an original and simple segmentation strategy based on the EM approach that resolves many informatics problems about hyperspectral images which are observed by airborne sensors. In a first step, to simplify the input color textured image into a color image without tex...

متن کامل

Mathematical Morphology Approach for Enhancement Digital Mammography Images

Mammography is a branch of radiology which could benefit greatly from the assimilation of digital imaging technologies. Computerized enhancement techniques could be used to ensure optimum presentation of all digital clinical images. In this research, two enhancement algorithms are proposed that are based on the mathematical morphology theory. The first proposed algorithm deals with the contrast...

متن کامل

A Pixon-based Image Segmentation Method Considering Textural Characteristics of Image

Image segmentation is an essential and critical process in image processing and pattern recognition. In this paper we proposed a textured-based method to segment an input image into regions. In our method an entropy-based textured map of image is extracted, followed by an histogram equalization step to discriminate different regions. Then with the aim of eliminating unnecessary details and achi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013